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FIELD OF THE INVENTION 

1. FIELD OF THE INVENTION: 

The present Invention relates to an audio 
5 information provision system for selecting and providing 
background music which is suitable to the objective and image 
of various spaces, for example, commercial spaces such as 
department stores and other types of stores, public spaces 
such as hotels and offices, or private spaces where people 
10 can enjoy themselves such as automobiles and houses. 

2. DESCRIPTION OF THE RELATED ART: 

Conventionally, background music has been provided 
to commercial spaces and public spaces mainly as follows. 

15 Music selecting specialists select background music which 
is suitable to the image of each commercial or public space, 
and the selected background music is provided in the form 
of recording media such as CDs or the like. Alternatively, 
background music channels of cable broadcasting including 

20 a wide variety of music programs are subscribed to. 

The system of having specialists select suitable 
background music uses expert knowledge and results in a high 
level of satisfaction of the users. However, this system 
25 is disadvantageously costly and thus can be accepted only 
by clients who highly appreciate the sales promoting effect 
of background music. In addition, the selected background 
music is not always suitable to the image and objective of 
the space or the type of customers or people within the space. 

30 

The use of recording media such as CDs requires the 
user to play the same background music recorded on the 
recording media for a certain period of time. 



- 2 - 



P27176 



The subscription to cable broadcasting allows the 
user only to select a music genre, and the user is also 
required to play the predetermined programs of the selected 
5 genre for a certain period of time. 

As can be appreciated from the above, it has 
conventionally been difficult to provide a user with 
background music suitable to the objective or image of each 
10 individual commercial or public space or taste of people 
present in the space in accordance with changes in time, 
environment and other conditions . 

SUMMARY OF THE INVENTION 

15 

An audio information provision system for providing 
a target with an audio information stream suitable to the 
target including a database for storing a plurality of audio 
information streams; an inherent condition input section 

20 for receiving an inherent condition of the target ; a variable 
condition input section for receiving a variable condition 
varying in accordance with time; a selection section for 
selecting at least one audio information stream from the 
plurality of audio information streams based on at least 

25 the inherent condition and the variable condition; and an 
output section for outputting the at least one audio 
information stream. 

In one embodiment of the invention, the database 
30 stores a plurality of related information streams 
respectively related to the plurality of audio information 
streams. Each of the related information streams has a 
coordinate value representing a position of the 
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corresponding audio information stream in a prescribed 
coordinate system which defines an impression of audio 
information streams. The selection section determines a 
coordinate value representing a position of the target in 
5 the prescribed coordinate system based on the inherent 
condition and the variable condition, and selects at least 
one audio information stream corresponding to at least one 
related information stream having a coordinate value which 
is located within a prescribed range from the coordinate 
10 value representing the position of the target. 

In one embodiment of the invention, at least one 
related information stream among the plurality of related 
information streams includes adjustment information which 

15 indicates that a distance between a coordinate value 
included in the at least one related information stream and 
the coordinate value of the target is adjusted, based on 
at least one of the inherent condition and the variable 
condition. The selection section changes the coordinate 

20 value included in the at least one related information 
stream. 

In one embodiment of the invention, the audio 
information provision system further comprises a reserved 
25 condition input section for receiving a reserved condition 
indicating that a preset audio information stream is output 
by the output section at a preset time. The output section 
outputs the preset audio information stream at the preset 
time. 

30 

In one embodiment of the invention, the audio 
information provision system further includes an economic 
condition input section for receiving an economic condition 
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representing a desired cost for the at least one audio 
information stream. The selection section selects at least 
one audio information stream, based on the economic 
condition, from among the at least one audio information 
5 stream selected from the plurality of audio information 
streams based on the inherent condition and the variable 
condition . 

In one embodiment of the invention, the plurality 
10 of related information streams further include a plurality 
of physical feature information streams each representing 
a physical feature of the corresponding audio information 
stream of the plurality of audio information streams and 
a plurality of bibliographical information streams each 
15 representing a bibliography of the corresponding audio 
information stream of the plurality of audio information 
streams . 

In one embodiment of the invention, the selection 
20 section is connected to the inherent condition input section, 
the variable condition input section and the output section 
via a communication line. 

In one embodiment of the invention, the target is 
25 one of a commercial space and a public space. 

In one embodiment of the invention, the target is 
an individual. The inherent condition represents inherent 
information of the individual. The variable condition 
30 represents mood information of the individual. 

In one embodiment of the invention, the audio 
information provision system further includes an economic 
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condition input section for receiving an economic condition 
representing a desired cost for the at least one audio 
information stream; a mood information analysis section for 
analyzing the mood information and outputting a mood 
information analysis result; and an individual information 
accumulation section for accumulating the inherent 
condition, the mood information analysis result and the 
economic condition. The selection section selects at least 
one audio information stream, based on the economic 
condition, from among the at least one audio information 
stream selected from the plurality of audio information 
streams based on the inherent condition and the mood 
information analysis result. 

In one embodiment of the invention, the mood 
information analysis result and the economic condition are 
accumulated in the individual information accumulation 
section as individual preference information representing 
an individual preference. The individual preference 
information is updated each time the mood information 
analysis result and the economic condition are input to the 
individual information accumulation section. 

In one embodiment of the invention, the audio 
information provision system further includes a 
satisfaction degree information input section for receiving 
satisfaction degree information representing a 
satisfaction degree of the individual for the at least one 
audio information stream. 

In one embodiment of the invention, the individual 
information accumulation section accumulates a past 
selection result provided by the selection section. The 
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audio information provision system further includes a 
feedback section for presenting to variable condition input 
section, as individual preference information representing 
an individual preference, the past selection result 
accumulated in the individual information accumulation 
section. The variable condition input section provides the 
individual with an input interface based on the individual 
preference information. 

In one embodiment of the invention, the audio 
information provision system further includes an economic 
condition input section for receiving an economic condition 
representing a desired cost for the at least one audio 
information stream; a mood information analysis section for 
analyzing the mood information and outputting a mood 
information analysis result; and an individual information 
accumulation section for accumulating the inherent 
condition, the mood information analysis result and the 
economic condition. The selection section selects at least 
one audio information stream from the plurality of audio 
information streams based on instruction information from 
a musicotherapist based on the inherent condition, the mood 
information analysis result and the economic condition. 

In one embodiment of the invention, the variable 
condition input section inputs impression information 
representing an impression of an audio information stream 
desired by the individual as the mood information. 

In one embodiment of the invention, the variable 
condition input section includes a display section. The 
variable condition input section provides the individual 
with a prescribed coordinate system which defines an 
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impression of audio information streams through the display 
section. The impression information is input to the 
variable condition input section by the individual 
specifying at least one point in the prescribed coordinate 
system. 

In one embodiment of the invention, the prescribed 
coordinate system includes a plurality of words 
representing the impression. The plurality of words are 
changed in accordance with the type of audio information 
stream desired by the individual. 

In one embodiment of the invention, the prescribed 
coordinate system has a plurality of image parts. 

In one embodiment of the invention, the impression 
is represented by at least one of a word, a color and a symbol. 

Thus, the invention described herein makes possible 
the advantages of providing a system for selecting 
background music suitable to the objective or image of 
commercial spaces such as department stores and other types 
of stores, public spaces such as hotels and offices, or 
private spaces where people can enjoy themselves such as 
automobiles and houses . 

These and other advantages of the present invention 
will become apparent to those skilled in the art upon reading 
and understanding the following detailed description with 
reference to the accompanying figures. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows an audio information provision system 
according to a first example of the present invention; 

5 

Figure 2 shows an information delivery system using 
the audio information provision system according to the 
first example and a communication line; 

10 Figure 3 shows a structure of a selection section 

of the audio information provision system according to the 
first example; 

Figure 4 shows an audio information provision system 
15 according to a second example of the present invention; 

Figures 5A, 5B and 5C each show an input interface 
used for inputting an inherent condition according to the 
first example; 

20 

Figure 6 shows an input interface used for inputting 
a variable condition according to the first example; 

Figure 7 shows an input interface used for inputting 
25 a reserved condition according to the first example; 

Figure 8 shows a structure of information stored in 
a database; 

30 Figure 9 shows a process for creating an impression 

space according to the first example; 



Figure 10 shows a process for obtaining a mapping 
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function according to the first example; 

Figure 11 shows a process for performing mapping to 
the impression space according to the first example; 

Figure 12 shows a component score coefficient for 
each space ID according to the first example; 

Figure 13 shows an impression space to which space 
IDs are mapped according to the first example; 

Figure 14 shows an impression space coordinate value 
for each space ID according to the first example; 

Figure 15 shows classification by time-and-day- 
of-the-week ID according to the first example; 

Figure 16 shows an impression space to which 
time -and- day- of -the-week IDs are mapped according to the 
first example; 

Figure 17 shows classification by season ID 
according to the first example; 

Figure 18 shows an impression space to which season 
IDs are mapped according to the first example; 

Figure 19 shows classification by weather condition 
ID according to the first example; 



Figure 20 shows classification by weather condition 
ID according to the first example; 
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Figure 21 shows adjustment Information of each 
musical genre by space ID according to the first example; 

Figure 22 shows adjustment information for related 
5 bibliographical information of each special space ID 
according to the first example; 

Figure 23 shows adjustment information for related 
bibliographical information of each special season ID 
10 according to the first example; 

Figure 24 shows a structure of a selection section 
of the audio information provision system according to the 
second example; 

15 

Figure 25 shows an audio information provision 
system according to a third example of the present invention; 

Figure 26 shows an input interface used by a user 
20 for inputting inherent information, mood information and 
economic condition to a terminal; 

Figure 27 shows mood information input by the user 
and the analysis result provided by a mood information 
25 analysis section; 

Figure 28 shows another audio information provision 
section according to the third example of the present 
invention; 

30 

Figure 29 shows an input interface used for 
inputting, to a satisfaction degree input section, whether 
the provided audio information was sampled, whether the user 
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was satisfied with the provided audio information stream, 
and the image the user has on the audio information stream 
that he/she was not satisfied with; 

5 Figure 30 shows an input interface used for 

inputting mood information to a variable condition input 
section; 

Figure 31 shows still another audio information 
10 provision system according to the third example of the 
present invention; 

Figure 32 shows an audio information provision 
system according to a fourth example of the present 
15 invention; 

Figure 33 shows an input interface of a variable 
condition input section and an economic condition input 
section used when the user demands musicotherapy ; 

20 

Figure 34 shows a coordinate system representing a 
music image chart according to a fifth example of the present 
invention; 

25 Figure 35 shows classification of representative 

adjectives by music image according to the fifth example; 

Figure 36 shows the relationship between the musical 
features and the music image parts ; 

30 

Figure 37 shows another coordinate system 
representing a music image chart according to the fifth 
example ; 
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Figure 38 shows still another coordinate system 
representing a music image chart according to the fifth 
example ; and 

5 

Figure 39 shows still another coordinate system 
representing a music image chart according to the fifth 
example . 

10 DESCRIPTION OF THE EMBODIMENTS 

Hereinafter, the present invention will be 
described by way of illustrative examples with reference 
to the accompanying drawings. 

15 

(Example 1) 

Figure 1 shows an audio information provision system 
100 according to a first example of the present invention. 
In the first example, the audio information provision system 
20 100 provides a commercial or public space with audio 
information suitable thereto as, for example, background 
music. 

The audio information provision system 100 includes 
25 a database 130 storing a plurality of audio information 
streams, an inherent condition input section 101 for 
receiving an inherent condition which is inherent to a 
commercial or public space, a variable condition input 
section 102 for receiving a variable condition which is 
30 variable in accordance with time, a selection section 120 
for selecting at least one audio information stream from 
the plurality of audio information streams based at least 
on the inherent condition and the variable condition, an 
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output section 140 for outputting the at least one audio 
information stream selected by the selection section 120, 
and a reserved condition input section 103 for receiving 
a reserved condition which indicates that a preset audio 
5 information stream is output by the output section 140 at 
a preset time. 

The audio information provision system 100 can be 
implemented in various forms, and elements of the audio 
10 information provision system 100 can be connected to each 
other in various forms. For example, each element of the 
audio information provision system 100 can be implemented 
by hardware, software, or a combination of hardware and 
software . 

15 

The selection section 120 can be connected to the 
inherent condition input section 101, the variable 
condition input section 102, the reserved condition input 
section 103, and the output section 140 through a 
20 communication line (as shown in Figure 2). 

Figure 2 shows an exemplary information delivery 
system provided by the audio information provision system 
100 using a communication line 153 according to the present 
25 invention. 

The information delivery system includes, for 
example, a terminal 151 used by a user in a commercial or 
public space 150 and a background music delivery center 154 
30 of an information service organization providing 
information to the terminal 151. In the commercial or 
public space 150, the audio information is recorded and 
reproduced by the terminal 151 and provided to the commercial 
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or public space 150 through a reproduction device 152 as 
background music. The database 130 and the selection 
section 120 are in the background music delivery center 154. 
The background music delivery center 154 manages a huge 
5 amount of audio information (for example, audio contents) 
stored in the database 130. The background music delivery 
center 154 and the terminal 151 transmit information to each 
other through a communication line 153. The communication 
line 153 can be, for example, a network, a wireless 
10 communication line or a wired communication line (for 
example, the Internet, a satellite communication line or 
a telephone line). 

The inherent condition input section 101, the 
15 variable condition input section 102, the reserved 
condition input section 103 and the output section 140 can 
be in the terminal 151. 

The terminal 151 can be, for example, a personal 
20 computer or a dedicated terminal device. 

In the case where the terminal 151 is a personal 
computer, the user can input an inherent condition, a 
variable condition and a reserved condition to the terminal 
25 151 using an input section such as a keyboard, a mouse, a 
touch pad or the like connected (wired or wireless) to the 
personal computer, while viewing a display 155. The user 
can also receive audio information from the reproduction 
device 152 connected to the personal computer. 

30 

In the case where the terminal 151 is a dedicated 
terminal device, each of the conditions can be input using 
the display 155 or the like incorporated therein. 
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An "inherent condition" refers to a condition which 
is inherent to a target to which the audio information is 
provided. An inherent condition is, for example, an image 
5 based on the product concept, building, location or type 
of customers of a commercial space such as a store. 

Figures 5A, 5B and 5C each show an exemplary input 
interface used as the inherent condition input section 101 
10 (Figure 1) used by the user to input an inherent condition 
to the terminal 151. Input interfaces 111, 112 and 113 
respectively shown in Figures 5A, 5B and 5C are displayed 
on, for example, the screen of the display 155 of the terminal 
151. 

15 

The input interfaces 111, 112 and 113 are preferably 
user-friendly input interfaces which represent the image 
of the commercial or public space 150 with words or colors. 

20 The words representing the image of the commercial 

or public space 150 shown in the input interfaces 111, 112 
and 113 are "impression representing words" , which is unique 
to an audio information provision system according to the 
present invention. The impression representing words can 

25 be selected using a mathematical technique such as factor 
analysis or principal component analysis from a plurality 
of words used by the music selecting specialists or store 
designers . 

30 The input interface 111 (Figure 5A) is designed to 

allow the user to input an image with an input bar including 
the impression representing words. The input interface 112 
(Figure 5B) is designed to allow the user to input an image 
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by specifying one point in a two-dimensional coordinate 
system including two perpendicular factors. The input 
interface 112 can have a color map including colors instead 
of the impression representing words. 

5 

The input interface 113 (Figure 5C) allows the user 
to input a specific name of a commercial or public space 
as an inherent condition. This dispenses the user from the 
need to consider the image of the store for providing the 
10 store with audio information suitable thereto. This also 
allows audio information which is certainly suitable for 
the store which it is to be provided. 

A "variable condition" refers to a condition which 
15 varies in accordance with time. The variable condition can 
vary from moment to moment and is, for example, the season, 
date, time, weather, temperature, humidity, or crowdedness . 

Figure 6 shows an exemplary input interface used as 
20 the variable condition input section 102 (Figure 1) used 
by the user to input a variable condition to the terminal 
151. An input interface 114 shown in Figure 6 is displayed 
on, for example, the screen of the display 155 of the terminal 
151. The time and date, and the day of the week are 
25 automatically displayed by the clock function of the 
personal computer or the dedicated terminal device acting 
as the terminal 151. The environmental data such as the 
weather, temperature and humidity can be automatically 
delivered by the communication function of the terminal 151. 
30 Thus, the user is dispensed from the trouble of inputting 
such data. 



A "reserved condition" refers to that a preset audio 
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information stream is output by the output section 140 
(Figure 1) at a preset time. When a reserved condition is 
input to the terminal 151, the output section 140 outputs 
the following predetermined audio information streams to 
5 the shoppers at the store at a predetermined time: for 
example, the time signal, a sound effect for promoting sales, 
a notice of time-limited discount or a notice of store 
opening or closing. The contents of the reserved condition 
(the type of audio information streams or time to output 
10 each audio information stream) can be set differently store 
to store. 

Figure 7 shows an exemplary input interface used as 
the reserved condition input section 103 (Figure 1) used 

15 by the user to input a reserved condition to the terminal 
151. An input interface 115 shown in Figure 7 is displayed 
on, for example, the screen of the display 155 of the terminal 
151. The input interface 115 allows the user to set the time 
to provide the store with the time signal or notice of store 

20 opening or store closing. The input interface 115 also 
allows the user to set the time to provide the sound effect 
for promoting sales or the notice of time-limited discount. 
The type of sound effect or the type of notice can be set 
by inputting a filename. 

25 

The database 130 stores in advance a huge amount of 
audio information streams . Figure 8 shows the form of the 
audio information streams being stored. The database 130 
stores audio information streams MU(1) through MU(n) and 
30 related information streams INFO(l) through INFO(n) 
respectively related to the audio information streams MU(1) 
through MU(n) . Related information stream INFO(n) includes , 
for example, a physical feature, an impression space 
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coordinate value, bibliographic information or the like 
regarding each audio information stream (for example, a 
song). A "physical feature" refers to the tempo, pitch, 
loudness, envelope or the like inherent to a song. A 
5 physical feature is obtained by general acoustic signal 
processing such as, for example, frequency spectrum 
analysis, Wigner analysis, autocorrelation analysis or the 
like of the song. An "impression space coordinate value" 
refers to a value in a coordinate system which represents 

10 an impression space determined in advance using the 
impression representing words inherent to the target space 
such as a commercial space. The coordinate system can 
define the impression of the audio information streams. 
"Bibliographical information" refers to the title, 

15 performer, formation of instruments, musical genre, time 
of composition of the song or the like. 

In this specification, the term "song" is defined 
to refer to a tune with or without lyrics . 

20 

Hereinafter, a method for obtaining an impression 
space coordinate value of an audio information stream will 
be described. Figure 9 shows a process for creating an 
impression space for commercial spaces. First, a plurality 

25 of words representing the impression of commercial spaces 
are extracted. From the plurality of words, representative 
pairs of impression representing words used by unspecified 
people with high frequency such as, for example, "quiet - 
lively" and "luxurious - frugal" are selected. Each of 

30 these representative pairs of impression representing words 
is determined as a representative measure. Then, using an 
impression space including coordinate axes, each for one 
representative measure, a plurality of evaluators perform 
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subjective evaluation of the images of the commercial spaces 
using a psychological evaluation technique such as the SD 
(Semantic Differential) method. The evaluation result is 
processed with a mathematical technique such as , for example , 
5 factor analysis or principal component analysis. Thus, 
from the above-mentioned impression space, another 
impression space having two coordinate axes perpendicular 
to each other is defined. Figure 13 shows an exemplary 
coordinate system which represents an impression space for 
10 the commercial spaces. 

Figure 10 shows a process for determining a mapping 
function for mapping songs, which are audio information 
streams, to the above -de scribed coordinate system which 

15 represents the impression space for the commercial spaces. 

A plurality of songs which are used as samples are processed 
with subject evaluation using the same measures and 
evaluators as used for the subject evaluation for the images 
of the commercial spaces. Then, from the analysis result 

20 and evaluation result obtained when the impression space 
for the commercial spaces is defined, an impression space 
coordinate value of each of the plurality of songs used as 
samples in the coordinate system representing the 
impression space for the commercial spaces is calculated. 

25 In parallel, physical features of the plurality of songs 
are calculated. From the relationship between the 
calculated coordinate values in the impression space and 
the calculated physical features, a mapping function for 
mapping a song to the coordinate system which represents 

30 the impression space for the commercial spaces is determined 
using a mathematical technique such as the multiple 
regression analysis or the like. Several ten to about a 
hundred songs can be used as samples for obtaining the 
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mapping function. 

Figure 11 shows a process for mapping an enormous 
number of unknown songs to the coordinate system which 
5 represents the impression space for the commercial spaces. 
Once a physical feature of an unknown song is calculated, 
an impression space coordinate value of the unknown song 
is automatically calculated based on the mapping function 
determined by the above-described process. Using the 

10 calculated impression space coordinate value, the unknown 
song can be mapped to the coordinate system which represents 
the impression space for the commercial spaces. The song 
of which the impression space coordinate value has been 
determined is stored in the database 130 as an audio 

15 information stream. The calculated physical feature and 
the calculated impression space coordinate value are also 
stored in the database 130 as related information streams . 

With reference to Figure 3, the selection section 
20 120 also shown in Figure 1 will be described. The selection 
section 120 includes an inherent condition coordinate value 
calculation section 121, a variable condition coordinate 
value calculation section 122, a bibliographical 
information evaluation value calculation section 123, a 
25 total evaluation value calculation section 124 , and an audio 
information play list creation section 125. 

The inherent condition coordinate value 
calculation section 121 analyzes the inherent condition of 
30 the commercial space which has been input to the inherent 
condition input section 101 and determines the impression 
space coordinate value suitable to the inherent condition. 
The variable condition coordinate value calculation section 
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122 analyzes the variable condition of the commercial space 
which has been input to the variable condition input section 
102 and determines the impression space coordinate value 
suitable to the variable condition. The bibliographical 
5 information evaluation value calculation section 123 
outputs adjustment information to the total evaluation 
value calculation section 124 . The adjustment information 
adjusts the probability at which an audio information stream 
relating to at least either one of the inherent condition 
10 which has been input to the inherent condition input section 
101 and the variable condition which has been input to the 
variable condition input section 102 is selected by the 
selection section 120. 

15 The total evaluation value calculation section 124 

analyzes the impression space coordinate value suitable to 
the inherent condition, the impression space coordinate 
value suitable to the variable condition, and the adjustment 
information, and selects an audio information stream from 

20 the database 130. The audio information play list creation 
section 125 analyzes the audio information stream selected 
by the total evaluation value calculation section 124 and 
the reserved condition which has been input to the reserved 
condition input section 103, and determines the order by 

25 which the plurality of audio information streams are to be 
output by the output section 140. Hereinafter, the 
operation of the selection section 120 will be described 
in more detail. 

30 Figure 12 shows a table representing results of 

subjective evaluation of the image of a plurality of 
commercial spaces . The table is used by the inherent 
condition coordinate value calculation section 121. In 



- 22 - 



P27176 



Figure 12, the space ID is used to identify a type of stores 
having a common representative image provided by the 
inherent condition input to the inherent condition input 
section 101. The subjective evaluation results are shown, 
5 for each space ID, by component score coefficients obtained 
by the SD method. For example, high quality store A 
represented by space ID 1 obtained the following score 
coefficients for the following images: (quiet, active, 
luxurious, moderate price, traditional, casual, modern, 

10 bright) = (0.5, -0.5, 0.4, -0.6, 0.9, -0.4, -0.4, 0.2). 
Figure 14 shows an example of the impression space coordinate 
values in a two-dimensional coordinate system which 
represents the impression space for the commercial spaces. 
The impression space coordinate values are obtained based 

15 on the above -de scribed results. Figure 13 shows two- 
dimensional coordinate system 160 representing the 
impression space for the commercial spaces. In the 
coordinate system 160, the impression space coordinate 
values of space IDs 1 through 7 are mapped. The center of 

20 each circle is the position represented by the impression 
space coordinate value of each of the spaces ID 1 through 
7 . Figure 13 indicates that the images of the commercial 
spaces such as department stores and other stores can be 
mapped to the coordinate system 160. 

25 

The process shown in Figure 3 can be summarized as 
follows. When the user inputs the image of a commercial 
space using, for example, the input interface 111 shown in 
Figure 5A, the component score coefficients shown in Figure 
30 12 are determined. Then, the impression space coordinate 
value is determined by the result of the principal component 
analysis. When the input interface 112 shown in Figure 5B 
is used, an impression space coordinate value is directly 
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input. When the input interface 113 shown in Figure 5C is 
used, the name of a specific commercial space input and an 
impression space coordinate value shown in Figure 14 is 
determined based on the name. The impression space 
5 coordinate value is the inherent condition coordinate value 
which is calculated by the inherent condition coordinate 
value calculation section 121. Here, the inherent 
condition coordinate value is represented as "A" . 

10 Figure 15 shows a table representing classification 

of a variable condition component of a commercial space 
regarding the time and the day of the week which has been 
input to the variable condition input section 102. The 
table is used by the variable condition coordinate value 

15 calculation section 122. In a commercial space, the type 
of customers and crowdedness are varied in accordance with 
the time and the day of the week. The time is classified 
as, for example, AM and PM, and the day of the week is 
classified as, for example, weekdays and weekend. In Figure 

20 15, each classification tag is provided with an ID (in 
parenthesis). For example, the time-and-day-of -the-week 
ID for Tuesday, 3 p.m. is WD2 . Figure 16 shows a result of 
nine time-and-day-of -the-week IDs obtained by combining the 
classification tags in Figure 15 being mapped to the 

25 coordinate system 160. It is not necessary to map all of 
the time-and-day-of -the-week IDs to the coordinate system 
160. 

Figure 17 shows a table representing classification 
30 of a variable condition component of a commercial space 
regarding the season which has been input to the variable 
condition input section 102. The table is used by the 
variable condition coordinate value calculation section 122. 
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In a commercial space, the taste and actions of customers 
are varied in accordance with the season. The season is 
classified as, for example, spring, summer, etc., or by month 
In Figure 17, each classification tag is provided with an 
5 ID. For example, the season ID for November 4 is AU2 . Since 
some events occur in a particular season, special season 
IDs are also provided. For example, a period from April 1 
to April 10 when schools hold entrance ceremonies is provided 
with the special season ID of SP2A. Figure 18 shows a result 
10 of the season IDs in Figure 17 being mapped to the coordinate 
system 160. It is not necessary to map all of the season 
IDs and all the special season IDs to the coordinate system 
160. 

15 Figure 19 shows a table representing classification 

of a variable condition component of a commercial space 
regarding the weather conditions which has been input to 
the variable condition input section 102. The table is used 
by the variable condition coordinate value calculation 

20 section 122. In a commercial space, the taste and actions 
of customers are varied in accordance with the weather 
conditions. The weather conditions are classified by the 
elements such as, for example, weather, temperature, 
humidity, discomfort index, etc. In Figure 19, each 

25 classification tag is provided with an ID. For example, 
when the weather is fine, the temperature is 15°C and the 
humidity is 30%, the discomfort index is comfortable, and 
thus the weather condition ID is F300 . The discomfort index 
Di is obtained by expression (1) from the temperature T (°C) 

30 and the relative humidity U (%) as follows: 



Di = 0.81T + 0.01U(0.99T - 14.3) + 46.3 ... (1). 



- 25 - 



P27176 



In this example, the discomfort index Di is 
classified into three stages of: comfortable, slightly 
uncomfortable and uncomfortable. Figure 20 shows a result 
of weather condition IDs obtained by combining the 
5 classification tags in Figure 19 being mapped to the 
coordinate system 160. It is not necessary to map all of 
the weather condition IDs to the coordinate system 160. 

The process performed by the variable condition 
10 coordinate value calculation section 122 shown in Figure 
3 will be further described. 

The user inputs a variable condition using the input 
interface 114 or the like shown in Figure 6 (alternatively, 

15 the time and date can be automatically set by a built-in 
function of the terminal 151, and the temperature and 
humidity can be delivered from the background music delivery 
center or the like as weather condition information which 
changes from moment to moment). Then, the time-and-day- 

20 of-the-week ID (Figure 15), the season ID or the special 
season ID (Figure 17), and the weather condition ID (Figure 
19) are determined. Thus, the impression space coordinate 
value C of the time-and-day-of -the-week ID, the impression 
space coordinate value D of the season ID (or the special 

25 season ID), and the impression space coordinate value E of 
the weather condition ID are determined. The variable 
condition coordinate value B is determined by expression 
(2) as follows: 

3 0 B = cC + dD + eE . . . (2) . 

The additions (+) performed in expression (2) 
indicate the following: when the positions represented by 
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the coordinate values C, D and E are In the same quadrant 
of the coordinate system, the variable condition coordinate 
value B is calculated so as to be at the center of the three 
positions; and when the positions represented by the 
5 coordinate values C, D and E are in different quadrants of 
the coordinate system, the variable condition coordinate 
value B is calculated by performing vector calculation of 
the coordinate values C, D and E. Weighting coefficients 
c, d and e are determined in accordance with a prescribed 
10 rule. Which of the coordinate values C, D and E is to be 
the main element to calculate the variable condition 
coordinate value B can be adjusted by giving different values 
to the weighting coefficients c, d and e. 

15 The total evaluation value calculation section 124 

shown in Figure 3 calculates a total evaluation value M, 
which is the final coordinate value of the target commercial 
space, using expression (3) based on the inherent condition 
coordinate value A and the variable condition coordinate 

20 value B. 

M = aA + bB ... ( 3 ) 

Like in expression (2), the addition (+) performed 
25 in expression (3) indicates the following: when the 
positions represented by the coordinate values A and B are 
in the same quadrant of the coordinate system, the total 
evaluation value M is calculated so as to be at the center 
between the two positions; and when the positions 
30 represented by the coordinate values A and B are in different 
quadrants of the coordinate system, the total evaluation 
value M is calculated by performing vector calculation of 
the coordinate values A and B. In expression (3), a and b 
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are weighting coefficients. 

The coordinate value represented by the total 
evaluation value M is the coordinate value of the target 
5 commercial space. Regarding the coordinate system 160, the 
total evaluation value calculation section 124 selects an 
audio information stream corresponding to a related 
information stream having coordinate values within a 
prescribed range (for example, one) from the coordinate 
10 value represented by the total evaluation value M. 

The adjustment information which is output by the 
bibliography information evaluation value calculation 
section 123 is stored in advance in the related information 

15 stream INFO(n) shown . in Figure 8. The adjustment 
information is created in advance using a meta data creation 
tool for the audio information provision system according 
to the present invention, with reference to the 
bibliographical information corresponding to the audio 

20 information stream. The adjustment information is 
determined for each space ID, each time -and- day- of -the- 
week ID, each season ID and each weather condition ID. For 
example, the adjustment information shows the following 
values: +oo when the corresponding audio information is 

25 "never selected" as the background music for the target 
commercial space, 0 when the corresponding audio 
information is "absolutely selected", 1/2 when the 
corresponding audio information is suitable, and 2 when the 
corresponding audio information is not very suitable. The 

30 adjustment information acts as a "filter for preventing 
deviation from social commonsense" so that songs such as 
"Chanson de 1' adieu" are never used in wedding reception 
houses . 
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Figure 21 shows an example of adjustment information 
for each musical genre for each space ID. Figure 22 shows 
an example of adjustment information for each special space 
5 ID. Figure 23 shows an example of adjustment information 
for each season ID. 

The process performed by the bibliographical 
information evaluation value calculation section 123 shown 
10 in Figure 3 will be further described. 

The user inputs an inherent condition and a variable 
condition using the input interfaces 111, 112, 113 and 114 
shown in Figures 5A, 5B, 5C and 6. Then, the bibliographical 

15 information evaluation value calculation section 123 
inquires to the database 130 about the adjustment 
information included in the related information stream 
INFO(n) shown in Figure 8 based on the space ID, the season 
ID, the time-and-day-of-the-week ID, the weather condition 

20 ID, and then retrieves an audio information stream related 
to the adjustment information stream corresponding to one 
of at least the inherent condition and the variable condition. 
The bibliographical information evaluation value H, which 
is the final adjustment information, is determined by 

25 expression (4) based on adjustment information F determined 
based on the bibliographical information related to the 
inherent condition of the audio information stream and 
adjustment information G determined based on the 
bibliographical information related to the variable 

30 condition of the audio information stream. 



H = f F + gG . . . ( 4 ) 
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In expression (4), f and g are weighting 
coefficients. The bibliographical information evaluation 
value H is output to the total evaluation value calculation 
section 124 . The total evaluation value calculation 
5 section 124 multiplies the distance between the coordinate 
value assigned to the above-mentioned related audio 
information stream and the coordinate value represented by 
the total evaluation value M, with the bibliographical 
information evaluation value H, so as to adjust the distance. 

10 The distance is adjusted by changing the coordinate value 
assigned to the related audio information stream. When the 
bibliographical information evaluation value H is 0, the 
distance is 0 and therefore the related audio information 
stream is necessarily selected by the total evaluation value 

15 calculation section 124. When the bibliographical 
information evaluation value H is +oo, the distance is +<» 
and therefore the related audio information stream is never 
selected by the total evaluation value calculation section 
124. 

20 

The audio information play list creation section 125 
shown in Figure 3 analyzes the audio information stream 
selected by the total evaluation value calculation section 
124 and the reserved condition, and determines the order 

25 in which the audio information streams are output from the 
output section 140 . The order can be determined in various 
manners in accordance with the objective. For example, the 
songs can be reproduced by the method of matching the tempo 
and loudness of the songs to the fluctuate curve of 1/f (f : 

30 frequency), to the human fatigue curve, or human biorhythm 
of the parasympathetic nervous system and the sympathetic 
nervous system. The audio information play list can be 
created in units of one hour or in units of one day. 
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The output section 140 can output the audio 
information in accordance with the play list output from 
the audio information play list creation section 125 shown 
5 in Figure 3. 

(Example 2) 

Figure 4 shows an audio information provision system 
200 according to a second example of the present invention. 
10 In Figure 4, identical elements as those described above 
with reference to Figure 1 bear identical reference numerals 
therewith and detailed descriptions thereof will be 
omitted. 

15 The audio information provision system 200 includes 

an economic condition input section 104 in addition to the 
elements of the audio information provision system 100 shown 
in Figure 1. An economic condition input to the economic 
condition input section 104 represents a desired cost, for 

20 example, a budget of the audio information stream to be 
provided to the target. The economic condition which is 
input to the economic condition input section 104 is output 
to the audio information play list creation section 125 as 
shown in Figure 24 . 

25 

The audio information play list creation section 125 
further selects audio information streams from the audio 
information streams selected by the total evaluation value 
calculation section 124 so that the cost is within the 
30 economic condition. From the further selected audio 
information streams and the audio information streams set 
based on the reserved condition, an audio information play 
list within the economic condition is created. For example. 
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when an upper limit of 5000 yen is provided by the economic 
condition on the audio information play list created in units 
of one day, an audio information stream is created including 
upper level songs corresponding to the budget in the audio 
5 information play list selected by the total evaluation value 
calculation section 124 so that the total cost is within 
5000 yen. 

(Example 3) 

10 Figure 25 shows an audio information provision 

system 300 according to a third example of the present 
invention. In the third example, the audio information 
provision system 300 provides an individual (or user) with 
audio information. The audio information provision system 

15 300 includes an inherent condition input section 101 for 
receiving individual inherent information which is inherent 
to an individual as described further below, a variable 
condition input section 102 for receiving mood information 
which represents the mood of the individual, and an economic 

20 condition input section 104 for receiving a desired service 
cost desired by the individual. The audio information 
provision system 300 further includes a mood information 
analysis section 31 for analyzing the mood information which 
is output from the variable condition input section 102 and 

25 then outputting the analysis result, an individual 
information accumulation section 32 for accumulating the 
individual inherent information output from the inherent 
condition input section 101, the analysis result output from 
the mood information analysis section 32 and the desired 

30 service cost output from the economic condition input 
section 104, a database 130 for storing audio information 
streams, a selection section 120 for performing a search 
in the database 130 based on the individual inherent 
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information, the analysis result, and the economic 
condition and thus providing an audio information stream 
which is suitable to the mood and the budget of the user, 
and an output section 140 for outputting the audio 
5 information stream selected by the selection section 120. 
The audio information for individuals is mainly music. 

The individual information accumulation section 32 
can be connected to the inherent condition input section 

10 101, the mood information analysis section 31, and the 
economic condition input section 104 through a 
communication line. The selection section 120 can be 
connected to the individual information accumulation 
section 32 also through a communication line. The output 

15 section 140 can be connected to the selection section 120 
through a communication line. Each communication line can 
be an arbitrary network, such as, for example, the Internet. 

In the example shown in Figure 25, the inherent 
20 condition input section 101, the variable condition input 
section 102, the economic condition input section 104, the 
mood information analysis section 31, and the output section 
140 are included in the terminal 151 (Figure 2). 

25 The individual information accumulation section 32 

can be set in a control center having an accounting 
processing function for counting the cost of the audio 
information streams provided to each user. 

30 In the example shown in Figure 25, the database 130 

and the selection section 120 can be included in a content 
holder. The content holder manages a huge amount of audio 
information streams (contents) stored in the database 130. 
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"Individual inherent information" refers to data 
which is inherent to the user. Examples of the individual 
inherent information include the name, sex, date of birth, 
5 occupation, birthplace, family structure, musical 
experience, favorite music, and credit card number of the 
user. 



"Mood information" refers to data which represents 
10 the feeling of the user. Examples of the mood information 
include (i) data which represents the state of the user 
himself /herself such as the feeling, emotion and 
psychological condition of the user, and (ii) data which 
represents the nature of music such as the mood, image and 
15 genre of the music that the user wants to listen to at a 
certain time. When the user does not know which genre of 
music that he/she wants to listen, it is not necessary to 
input the genre. It is preferable, though, to input the 
genre of the music that he/she wants to listen, in order 
20 to obtain music which is closer to the mood of the user. 



A "desired service cost" refers to the cost that the 
user is ready to pay in exchange of the audio information 
provision service. The user can input any amount of money 

25 as the desired service cost in consideration of their budget . 

The user can determine the desired service cost in accordance 
with the duration, number of songs or quality of the music 
provided. Alternatively, the user can determine the 
desired service cost in consideration of the effect provided 

30 by the music in accordance with the suitability of the music 
to his/her mood. Still alternatively, the user can 
determine the desired service cost in consideration of the 
production cost of the music that the user assumes. 
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The input interface used by the user to input the 
mood information is preferably a user- friendly input 
interface which represents the image of the music desired 
5 by the user with words or colors. 

The input interface can be a check box as shown in 
Figure 26, in addition to the input interfaces 111 (input 
bar), 112 (map) and 113 shown in Figures 5A, 5B and 5C. 

10 Figure 26 shows an input interface 116 which includes check 
boxes and allows the user to input detailed setting for 
individuals. The input interface 116 also allows the 
individual inherent information and the desired service 
cost to be input to the terminal. The input interface 116 

15 is displayed on, for example, the screen of the display 155 
of the terminal 151 (Figure 2). 

The input interface 116 including check boxes allows 
the user to input the mood information by selecting the words 
20 which represent the image of the music he/she desires. Such 
words are, for example, words representing the feelings such 
as "calm" or "cheerful and happy" , words representing a 
location such as "southern" or "seaside", or words 
representing a color such as "red" or "blue". 

25 

Using the input interface 116 shown in Figure 26, 
the user inputs the individual inherent information (for 
example, text data representing the name of the user) and 
clicks the check box of the desired type of music in the 
30 musical genre check list (for example, the check box of 
"bossa nova") . The user also clicks the check box or boxes 
of the words representing the image of the desired music 
(for example, the check boxes of "calm" , "cheerful and happy" 
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and "southern"), and inputs the desired service cost. In 
this manner, the individual inherent information is input 
to the inherent condition input section 101, the mood 
information is input to the variable condition input section 
5 102, and the desired service cost is input to the economic 
condition input section 104. In this example, the musical 
genres that the user likes and does not like can be registered 
in advance as individual inherent information. 

10 The individual inherent information and the desired 

service cost which have been input are accumulated in the 
individual information accumulation section 32 together 
with the credit card number or the like. The mood 
information which has been input is analyzed by the mood 

15 information analysis section 31. The analysis result is 
represented as values weighted by different coefficients 
for a plurality of different musical representation 
factors . 

20 Figure 27 shows an example of mood information input 

by the user and the analysis result provided by the mood 
information analysis section 31. In the example shown in 
Figure 27, the mood information is data which represents 
the image of the music desired by the user. A "musical 

25 representation factor" refers to a factor used to represent 
the nature of the music (for example, a cheerfulness factor 
or a sadness factor) . The nature of the music is represented 
by values f(l), f(2), f(m) (where m is a total number 

of musical representation factors) weighted by different 

30 coefficients for a plurality of different musical 
representation factors. 



In the following description, "values weighted by 
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different coefficients for a plurality of different musical 
representation factors" will be referred to also as an 
"analysis result using musical representation factors". 

5 The mood information is transformed into an analysis 

result using musical representation factors in compliance 
with a mood representation rule. The mood representation 
rule is defined, in advance, by a table which transforms 
an image of music into values of musical representation 
10 factors by a psychological technique such as the SD method 
or the mult i -dimensional scaling. 

The mood information analysis section 31 outputs the 
analysis result using musical representation factors to the 
15 individual information accumulation section 32. 

The individual information accumulation section 32 
accumulates the individual inherent information, the 
analysis result of the mood information and the desired 

20 service cost as described above, and also sends information 
representing a selection condition (i.e., the desired 
service cost and the analysis result using musical 
representation factors) to the selection section 120. The 
analysis result of the mood information and the desired 

25 service cost are accumulated in the individual information 
accumulation section 32 as at least a part of individual 
preference data which represents the taste of the user. The 
individual preference data is updated each time the analysis 
result of the mood information and the desired service cost 

30 are input. 

The selection section 120 performs a search in the 
database 130 based on the desired service cost and the 
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analysis result using musical representation factors. 

The structure of the audio information streams 
stored in the database 130 is similar to that shown in Figure 
5 8. 

Referring to Figure 8, the related information 
streams INFO(l) through INFO(n) each include 
bibliographical information (title of the song, performer, 

10 musical genre, formation of instruments, or the like), the 
result of analysis of the audio information streams using 
musical representation factors (i.e., values g(l), 
g(2), g(m) (where m is a total number of musical 

representation factors) weighted by different coefficients 

15 for a plurality of different musical representation 
factors), and a basic provision cost. 

The "basic provision cost" refers to a basic cost 
which is calculated based on copyright managing cost , 
20 production cost and the like. 

The analysis results using musical representation 
factors included in the related information streams of the 
database 130 are obtained by analyzing the audio information 
25 streams in a method similar to the method used by the 
information analysis section 31. 

The selection section 120 calculates a sum S of the 
absolute values of the differences between the analysis 
30 results using musical representation factors provided by 
the mood information analysis section 31 (i.e., the values 
f(l). f(2), f(m) weighted by different coefficients 

for a plurality of different musical representation 
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factors) and analysis results using musical representation 
factors included in the related information streams of the 
database 130 (i.e. , the values g(l) , g(2) , . . . , g(m) weighted 
by different coefficients for a plurality of different 
5 musical representation factors) in accordance with 
expression (5) . The above-mentioned sum S will be referred 
to as a "difference S", hereinafter. 

S = I |f (i) - g(i) | (i = 1, 2 m) ... (5) 

10 

The selection section 120 outputs audio information 
streams corresponding to the related information streams, 
as the selection result. The audio information streams are 
output in the order starting from the audio information 

15 stream corresponding to the smallest difference S. As the 
selection result, a single audio information stream can be 
output, or a plurality of audio information streams can be 
output. The number of audio information streams which are 
output as the selection result is determined in a manner 

20 described below. 

The selection section 120 adds an adaptation cost 
to the basic provision cost of each audio information stream. 
The "adaptation cost" is obtained by multiplying the basic 

25 provision cost by an adaptation ratio R. The adaptation 
ratio R increases as the difference S is smaller (i.e., the 
accuracy of the selection result with respect to the audio 
information stream demanded by the user is higher) . It 
should be noted that the upper limit of the adaptation ratio 

30 R is specified. Alternatively, the upper limit of the 
adaptation ratio R can be automatically determined based 
on the number of audio information streams provided as the 
selection result, the basic provision cost, and the desired 
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service cost within a range of, for example, ±10% (the margin 
can be freely determined by the music providing side, for 
example, the content holder). 

5 The number of audio information streams which are 

output as the selection result is determined in accordance 
with the desired service cost . Audio information streams 
are output until the grand total of the total costs exceeds 
the desired service cost. The total cost of each audio 
10 information stream is the sum of the basic provision cost 
and the adaptation cost. In this manner, at least one audio 
information stream is output as the selection result. Even 
an identical song may cost differently to different 
individuals when the adaptation ratio is different. 

15 

Table 1 show an exemplary selection result provided 
by the selection section 120. In this example, the desired 
service cost is 500 yen, and the upper limit of the adaptation 
ratio R is 25%. 

20 

Table 1 





Music file 
No. 


Basic 
provision 
cost (yen) 


Difference 
S 


Adaptation 
ratio R ( % ) 


Provision 
cost (yen) 


1. 


#00011 


100 


0.11 


25 


125 


2. 


#03770 


100 


0.19 


20 


120 


3. 


#00462 


200 


0.25 


15 


230 


4. 


#09944 


150 


0.26 


10 


165 



The total cost of three songs counted from the song 
25 corresponding to the smallest difference S is 475 yen, which 
is less than the desired service cost of 500 yen. The total 
cost of four songs counted from the smallest difference S 
is 640 yen, which exceeds the desired service cost of 500 
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yen. Therefore, the selection section 120 outputs the upper 
three songs (i.e., music file numbers #00011, #03770 and 
#00462) . 

5 Due to such a system, according to the audio 

information provision system 300, even an identical audio 
information streams is purchased at different costs by each 
individual . 

10 The audio information stream output from the 

selection section 120 is provided to the user through the 
output section 140. 

It is preferable to adopt a system of allowing the 
15 user to listen to the audio information stream for a 
prescribed time period (for example, 45 seconds) free of 
charge so that the user feeds back to the audio information 
provision system whether the user is satisfied with the 
provided audio information stream. 

20 

Figure 28 shows an audio information provision 
system 400 having such a feedback function. In Figure 28, 
identical elements as those described above with reference 
to Figure 25 bear identical reference numerals therewith 
25 and detailed descriptions thereof will be omitted. 

The audio information provision system 400 includes 
a satisfaction degree information input section 105 and a 
feedback section 36 in addition to the elements shown in 
30 Figure 25. 

The satisfaction degree information input section 
105 is structured so that the user can input information 
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indicating whether the user is satisfied with the provided 
audio information stream. 

More specifically, the user can sample the provided 
5 audio information stream and then input satisfaction degree 
information, which indicates whether the user is satisfied 
with the provided audio information stream, to the 
satisfaction degree information input section 105. When 
the user inputs information indicating that "he/she is 
10 satisfied with the provided audio information stream" to 
the satisfaction degree information input section 105, such 
information is provided to the individual information 
accumulation section 32. 

15 It is preferable that the individual information 

accumulation section 32 notifies the accounting section to 
bill the user only when it has received the information 
indicating that "the user is satisfied with the provided 
audio information stream". Thus, the user is not billed 

20 until the user is satisfied. 

When the user inputs information indicating that 
"the user is not satisfied with the provided audio 
information stream" to the satisfaction degree information 

25 input section 105, such information is provided to the 
individual information accumulation section 32. In this 
case, it is preferable that the user also inputs the image 
he/she has on the audio information stream that he/she is 
not satisfied with, to the satisfaction degree information 

30 input section 105. Thus, the satisfaction degree of the 
user (or how much the provided audio information stream 
matches the mood of the user and the budget) can be fed back 
to the audio information provision system 400. 
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Figure 29 shows an exemplary input interface 117 
used to input, to the satisfaction degree information input 
section 105, (i) whether the provided audio information was 
5 sampled or not, (ii) whether the user was satisfied with 
the provided audio information stream, and (iii) the image 
the user has on the audio information stream that he/she 
was not satisfied with. The input interface 117 is 
displayed on, for example, the screen of a display of the 
10 terminal (not shown). 

Using the input interface shown in Figure 29, the 
user inputs the image he/she has on the audio information 
stream that he/she was not satisfied with, to the 

15 satisfaction degree information input section 105. In the 
example shown in Figure 29, the images of the music are 
represented by the same words as those used in the input 
interface 116 shown in Figure 26. These images of the music 
are transformed into values weighted by different 

20 coefficients for a plurality of different musical 
representation factors (analysis result using musical 
representation factors). 

The analysis result using musical representation 
25 factors which has been input to the satisfaction degree 
information input section 105 is sent to the individual 
information accumulation section 32. 

The individual information accumulation section 32 
30 updates the analysis result using musical representation 
factors and also outputs the updated selection condition 
to the selection section 120. By updating the analysis 
result using musical representation factors accumulated in 
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the individual information accumulation section 32, the 
precision of the analysis result using musical 
representation factors improves as the same user continues 
to use the audio information provision system 400 over time. 
5 As a result, the individual adaptability to that user is 
improved . 

The selection section 120 performs another search 
in the database 130 based on the updated selection condition. 

10 

In this manner, the satisfaction degree of the user 
(or how much the provided audio information stream matches 
the mood of the user and the budget) can be fed back to the 
audio information provision system 400. 

15 

Returning to Figure 28, the individual information 
accumulation section 32 accumulates the past selection 
results in addition to the individual inherent information, 
mood information, and the desired service cost. 

20 

The feedback section 36 refers to the past selection 
results accumulated in the individual information 
accumulation section 32 as individual preference data and 
notifies the individual preference data to the variable 
25 condition input section 102. 

The variable condition input section 102 includes 
a plurality of input interfaces. The variable condition 
input section 102 is designed to provide the user with an 
30 input interface corresponding to the individual preference 
data notified by the feedback section 36 among the plurality 
of input interfaces. 
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Figure 30 shows an exemplary input interface 118 
used to input mood information to the variable condition 
input section 102. The input interface 118 show in Figure 
30 is displayed on, for example, the screen of a display 
5 of the terminal (not shown). 

In the example shown in Figure 30, it is assumed that 
an audio information stream in the musical genre of bossa 
nova was provided to the user in an immediately previous 
10 selection. The genre of the audio information stream 
provided to the user in the immediately previous selection 
can be found by referring to the past selection results 
accumulated in the individual information accumulation 
section 32. 

15 

The feedback section 36 refers to the past selection 
results accumulated in the individual information 
accumulation section 32, and controls the input interface 
in the variable condition input section 102 based on the 
20 past selection results. As a result, as shown in Figure 30, 
check boxes for Latin music genres similar to bossa nova 
obtained in the immediately previous selection (for example, 
Afro-Cuban, salsa, rumba or the like) are added. 

25 The variable condition input section 102 can have 

an input interface usable to input information representing 
musical elements (for example, rhythm, key, tempo, beat and 
the like) . When the user has knowledge of music, the user 
can input mood information using the input interface 

30 representing the musical elements. Thus, mood information 
having a higher adaptability can be input. 



Figure 31 shows an audio information provision 
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system 500. In Figure 31, identical elements as those 
described above with reference to Figure 25 bear identical 
reference numerals therewith and detailed descriptions 
thereof will be omitted. 

5 

The audio information provision system 500 includes 
an audio information processing section 37 in addition to 
the elements shown in Figure 25. 

10 The audio information processing section 37 

transforms information representing musical elements (for 
example, rhythm, key, tempo, beat and the like) into a file 
format on the database 130 and sends the transformed 
information to the selection section 120. The selection 

15 section 120 selects and outputs audio information streams 
as described above. 

(Example 4) 

Figure 32 shows an audio information provision 
20 system 600 according to a fourth example of the present 
invention. In Figure 32, identical elements as those 
described above with reference to Figure 25 bear identical 
reference numerals therewith and detailed descriptions 
thereof will be omitted. 

25 

When the user inputs individual inherent 
information, mood information and a desired service cost 
in the hope of obtaining specialized mus ico therapy , the 
input data is sent to the individual information 
30 accumulation section 32 in the control center. The control 
center accumulates the input data in the individual 
information accumulation section 32 and, when necessary, 
sends the input data to an individual information 
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accumulation section 39 in a musicotherapy association to 
which musicotherapists are registered. The data sent the 
musicotherapy association is accumulated in the individual 
information accumulation section 39. The individual 
5 information accumulation section 39 can be connected to the 
individual information accumulation section 32 through an 
arbitrary type of communication line. 

In this case, the variable condition input section 
10 102 provides the user with an input interface which is 
similar to a medical examination questionnaire in which the 
user is to describe his/her physical and mental states . The 
economic condition input section 104 provides the user with 
an input interface which allows the user to select a time 
15 period and a cost of one session. 

Figure 33 shows an input interface 119 provided by 
the variable condition input section 102 and the economic 
condition input section 104 when the user demands 
20 musicotherapy. 

A musicotherapist analyzes the data accumulated in 
the individual information accumulation section 39 based 
on expert knowledge and inputs the analysis result (for 

25 example , data which indicates what type of music is suitable ) 
to a music information processing section 38. The music 
information processing section 38 is included in, for 
example, the terminal 151 (Figure 2). The musicotherapist 
generally has knowledge that, for example, "the first 

30 movement of Mozart's Symfonia Concertante is effective to 
an insomniac". Therefore, the musicotherapist inputs 
instruction information that "look for Mozart's Symfonia 
Concertante and songs similar thereto" to the music 
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Information processing section 38 in order to provide the 
insomniac with a suitable audio information stream. 

The music information processing section 38 
5 performs acoustic signal analysis such as frequency 
spectrum analysis, Wigner analysis, autocorrelation 
analysis or the like of the designated song, and thus 
extracts musical physical features such as the tempo, pitch, 
loudness, envelope, sound features and the like. Then, the 
10 music information processing section 38 sends these musical 
physical features as an instruction information processing 
result to the selection section 120. The selection section 
120 can be connected to the music information processing 
section 38 through an arbitrary type of communication line. 

15 

Based on the instruction information processing 
result, the selection section 120 performs a search in the 
database 130. The selection section 120 selects and outputs 
audio information streams as described above. 

20 

Such a service can select and provide audio 
information streams which are suitable to various states 
and various types of mood of the user at a cost desired by 
the user. 

25 

In the information communication society of today, 
an enormous number of people have physical and mental stress . 
The audio information provision system 600 in the fourth 
example can select and provide music which is suitable to 
30 each feeling or each physical and mental state so as to 
encourage and heal these people. Especially, songs for 
musicotherapy have conventionally been selected based on 
knowledge from psychiatric counselors and therapists. 



- 48 - 



P27176 



According to the audio information provision system 600 of 
the present invention, a great number of songs suitable to 
the physical and mental states of patients can be easily 
selected and provided in a short time period. 

5 

(Example 5) 

Figure 34 shows a coordinate system 700 which is a 
music image chart in a fifth example of the present invention. 
The coordinate system 700 is provided to the user as an input 

10 interface provided by the variable condition input section 
102 of the above -described examples. The coordinate system 
700 is displayed on, for example, the screen of the display 
155 (Figure 2). The coordinate system 700 represents an 
impression space used for locating images of each individual 

15 audio information stream belonging to pop and popular music 
genres on a two-dimensional coordinate system. Coordinate 
axes 701 are representative factor axes for the pop and 
popular music genres. The vertical axis is a still-active 
axis, and the horizontal axis is a rhythm-melody axis. 

20 Reference numeral 702 represents image parts which indicate 
how images of the pop and popular music are classified. When 
an individual specifies at least one point in the coordinate 
system 700, impression information representing the 
impression of an audio information stream is input to the 

25 variable condition input section 102. 

Each image part includes the following adjectives 
(A) through (I) which are used by the sampler of an audio 
information stream for representing images of the music. 

30 



(A) Adjectives representing calmness: calm, mild, 
carefree, ingenuous, soft 

(B) Adjectives representing degree of 



- 49 



P27176 



sentimentality: romantic, sentimental, deep, dramatic 

(C) Adjectives representing naturalness: natural, 
stable, neutral, monotonous, simple 

(D) Adjectives representing light-f ootedness : 
5 light-footed, refreshing, clear-cut 

(E) Adjectives representing curiousness: 
mysterious, unique, curious 

(F) Adjectives representing dynamicalness : 
vigorous, high-spirited, dynamic, vital, active, pop 

10 (G) Adjectives representing tenseness: sharp, 

tense, exciting, cool, tight 

(H) Adjectives representing intensiveness : 

violent, sweltering, powerful, energetic, wild, noisy, 

lively, boisterous, electric, mechanical, dashing 
15 (I) Adjectives representing sophistication: 

danceable, urban, stylish, sophisticated 

The images of each audio information stream are 
associated with musical features such as the tempo, 
20 frequency characteristics, formation of instruments, pitch, 
fluctuation of tempo, and the like. Therefore, all the 
genres of pop and popular music are mapped on the music image 
chart by the classification by the musical features and the 
classification of the image. 

25 

The coordinate system is created and mapping is 
performed in basically the same process as that described 
in the first example. Hereinafter, a process for creating 
the coordinate system and the performing of mapping will 
30 be described. 

The adjectives representing images of music, the 
image parts and the representative factor axis are 
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determined using the following psychological techniques . 
First, a psychological technique referred to as the sound 
source description selection method is used. The sound 
source description selection method selects representative 
5 adjectives, representing audio information streams, from 
language data which unspecified people associate to images 
perceived when actually sampling the audio information 
streams. Consideration of frequency of use of the 
adjectives and semantic associations of the adjectives with 

10 the images is included in the sound source description 
selection method. Then, a psychological technique such as 
the SD method or the like is used to perform multiple- stage 
evaluation of the images of the audio information streams. 
As the multiple- stage evaluation, five-stage or seven-stage 

15 evaluation is typically used. From the result of the 
multiple- stage evaluation, psychological feature of each 
audio information stream is obtained. Representative 
factors are determined by factor analysis such as principal 
component analysis or the like. Representative factors are 

20 selected from the representative adjectives so that the 
total of the evaluated suitability degree of each adjective 
is equal to or greater than 75%. When two factors amount 
to less than 75%, three factors are used as representative 
factors. On the two-dimensional plan having the 

25 representative factor axes as X and Y axes , the psychological 
feature is mapped. Thus, the image chart is created. 
Figure 35 shows classification of adjectives which are 
described highly frequently regarding pop and popular 
music. 

30 

Figure 36 shows the relationship between the musical 
features of audio information streams and image parts. In 
the case of audio information stream 1, the tempo is 140, 
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the amplitude is large in a low frequency range and a high 
frequency range, the average pitch is 2500 Hz, and the 
fluctuation of the tempo is 1/f 2 . In this case, the image 
part is Hh. In the case of audio information stream 2, the 
tempo is 70, the amplitude gradually decreases as the 
frequency increases, the average pitch is 800 Hz, and the 
fluctuation of tempo is 1/f. In this case, the image part 
is Bb. Such classification of the image part can be made 
by an expert familiar with music or by automatic processing. 
Once all the genres of pop and popular music are mapped on 
the music image chart, the music image chart can be used 
as an input interface for searching for audio information 
streams providing a certain image or for selecting songs. 
Thus, songs having a particular image associated to the 
user can be more likely and intuitively retrieved or 
selected. 

The music image chart can be used in order to present 
search results or song selection results. 

Figure 37 shows a coordinate system 710 which is an 
exemplary modification of the coordinate system 700. In the 
coordinate system 710 shown in Figure 37, colors associated 
with words are used for the image parts. 

Figure 38 shows a coordinate system 720 which is an 
exemplary modification of the coordinate system 710 shown 
in Figure 37 . In the coordinate system 720 shown in Figure 
38, the shapes and colors of coordinate points representing 
audio information streams mapped on the image parts can be 
altered in accordance with the image. For example, the 
coordinate point representing the audio information stream 
on an image part 721 representing the degree of 
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sentimentality can be a heart-shaped symbol and pink. The 
coordinate point representing the audio information stream 
on an image part 722 representing the tenseness can be a 
triangular symbol and black. Alternatively, the size of a 
5 coordinate point can be changed in accordance with the 
strength of the image. On the same image part, a coordinate 
point representing an audio information stream giving a 
stronger image can be larger (see, for example, an image 
part 723). Thus, the user can intuitively associate an 
10 audio information stream with an image. 

Figure 39 shows a coordinate system 730 which is 
another exemplary modification of the coordinate system 700 
shown in Figure 34. The coordinate system 730 shows an 

15 exemplary image chart created for classical music. The 
shapes and sizes of the image parts and the adjectives 
representing the images of the music are changed to be 
suitable to classical music from those used in the coordinate 
system 700 for pop and popular music. By using the 

20 adjectives representing images of the music which are more 
suitable to the type of audio information streams desired 
by each individual, an image of the audio information streams 
desired by the individual is more easily obtained. 

25 According to the present invention, an audio 

information stream is selected from a plurality of audio 
information streams based on the inherent condition of the 
target to which an audio information stream is to be provided 
and the variable condition which changes in accordance with 

30 time. Thus , an audio information stream fulfilling both of 
the inherent condition and the variable condition can be 
provided. 
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According to the present invention, music which is 
suitable to the objective, image, change in accordance with 
time and change in accordance with environment of commercial 
spaces , public spaces and private spaces where people can 
5 enjoy themselves can be selected by a simple method in a 
short time period. Thus, the cost which is conventionally 
quite high by reliance on specialists can be reduced, and 
music suitable to each listener can be provided from a wide 
variety of selections. 

10 

According to the present invention, use of a music 
image chart as an input interface provided by the variable 
condition input section allows the user to intuitively 
retrieve and select audio information streams which are best 
15 suited to his/her mood. 

Various other modifications will be apparent to and 
can be readily made by those skilled in the art without 
departing from the scope and spirit of this invention. 
20 Accordingly, it is not intended that the scope of the claims 
appended hereto be limited to the description as set forth 
herein, but rather that the claims be broadly construed. 



